首页> 外文OA文献 >Learning Spatial Regularization with Image-level Supervisions for Multi-label Image Classification
【2h】

Learning Spatial Regularization with Image-level Supervisions for Multi-label Image Classification

机译:用图像级监督学习空间正规化   多标签图像分类

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Multi-label image classification is a fundamental but challenging task incomputer vision. Great progress has been achieved by exploiting semanticrelations between labels in recent years. However, conventional approaches areunable to model the underlying spatial relations between labels in multi-labelimages, because spatial annotations of the labels are generally not provided.In this paper, we propose a unified deep neural network that exploits bothsemantic and spatial relations between labels with only image-levelsupervisions. Given a multi-label image, our proposed Spatial RegularizationNetwork (SRN) generates attention maps for all labels and captures theunderlying relations between them via learnable convolutions. By aggregatingthe regularized classification results with original results by a ResNet-101network, the classification performance can be consistently improved. The wholedeep neural network is trained end-to-end with only image-level annotations,thus requires no additional efforts on image annotations. Extensive evaluationson 3 public datasets with different types of labels show that our approachsignificantly outperforms state-of-the-arts and has strong generalizationcapability. Analysis of the learned SRN model demonstrates that it caneffectively capture both semantic and spatial relations of labels for improvingclassification performance.
机译:在计算机视觉中,多标签图像分类是一项基本但具有挑战性的任务。近年来,通过利用标签之间的语义关系已经取得了巨大的进步。但是,由于通常不提供标签的空间注释,因此传统方法无法对标签中的标签之间的潜在空间关系进行建模。本文提出了一种统一的深度神经网络,该网络仅利用标签之间的语义和空间关系图像级监督。给定一个多标签图像,我们提出的空间正则化网络(SRN)会为所有标签生成注意力图,并通过可学习的卷积捕获它们之间的潜在关系。通过使用ResNet-101网络将常规分类结果与原始结果进行汇总,可以持续提高分类性能。整个深度神经网络仅使用图像级注释进行端到端训练,因此无需在图像注释上进行额外的工作。在具有不同标签类型的3个公共数据集上进行的广泛评估表明,我们的方法明显优于最新技术,并且具有很强的概括能力。对学习到的SRN模型的分析表明,它可以有效地捕获标签的语义和空间关系,从而提高分类性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号